Tracking Morphophonemic Transformation in Arabic Word Generation and Root Extraction
نویسندگان
چکیده
Performing root-based searching, concordancing, and grammar checking in Arabic requires an efficient method for matching stems with roots and vice versa. Such mapping is complicated by the hundreds of manifestations of the same root; the radicals often undergo replacement, fusion, inversion, and/or deletion. It is a challenge, therefore, to keep track of original radicals. An algorithm based on methods used by native speakers is proposed here to track root radicals in the generation process and the subsequent reversal process of root extraction. Verb roots are classified by the types of their radicals and the stems they generate. Roots are molded with morphosemantic and morphosyntactic patterns to generate stems modified for tense, voice, and mode, affixed for different subject number, gender, and person. The surface forms of applicable morphophonemic transformation are then derived using finite state machines. This paper defines what is meant by `stem', describes a stem generation engine that the authors developed, and outlines how a generated stem database is compiled for all Arabic verbs.
منابع مشابه
Towards a new Approach for Arabic root extraction: Exploit relations between the word letters and their placement in the word for Arabic root extraction
This paper presents a new root-extraction approach for Arabic words. The approach tries to assign for Arabic words a unique root without relying on a database of word roots, a list of word patterns or a list of all the prefixes and the suffixes of the Arabic words. Unlike most of Arabic rule-based stemmers, it tries to predict the root-letters positions one by one based on some rules and relati...
متن کاملA Word Grammar of Turkish with Morphophonemic Rules
A WORD GRAMMAR OF TURKISH WITH MORPHOPHONEMIC RULES Oztaner, Serdar Murat M.S., Department of Computer Engineering Supervisor: Assist. Prof. Dr. Cem Boz sahin January 1996, 128 pages This thesis is about the computational morphological analysis and generation of Turkish word forms. Turkish morphological description is encoded using the two-level morphological model. This description consists ...
متن کاملMachine Learning of Phonologically Conditioned Noun Declensions For Tamil Morphological Generators
This paper presents machine learning solutions to a practical problem of Natural Language Generation (NLG), particularly the word formation in agglutinative languages like Tamil, in a supervised manner. The morphological generator is an important component of Natural Language Processing in Artificial Intelligence. It generates word forms given a root and affixes. The morphophonemic changes like...
متن کاملRule-based Approach for Arabic Root Extraction: New Rules to Directly Extract Roots of Arabic Words
Extracting word roots in Arabic language is very problematic due to the specific morphological and structural changes in the language. To address this problem, several techniques have been proposed. This paper continues the problem of identifying and exploiting relationship amongst Arabic letters for Arabic root extraction begun in [1]. Eight different rules that detect the root letters accordi...
متن کاملSystematic Verb Stem Generation For Arabic
Performing root-based searching, concordancing, and grammar checking in Arabic requires an efficient method for matching stems with roots and vice versa. Such mapping is complicated by the hundreds of manifestations of the same root. An algorithm based on the generation method used by native speakers is proposed here to provide a mapping from roots to stems. Verb roots are classified by the typ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Int. Arab J. Inf. Technol.
دوره 4 شماره
صفحات -
تاریخ انتشار 2007